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RAW SEQUENCE LISTING DATE: 04/01/2005 

PATENT APPLICATION: US/10/528,674 TIME: 10:54:06 

Input Set : A:\0185659SeqList.txt 
Output Set: N:\CRF4\04012005\J528674.raw 

3 <110> APPLICANT: DSM IP ASSETS B.V. 

5 <120> TITLE OF INVENTION: Transcriptional activator gene for genes involved in 
cobalamin 

6 biosynthesis 

8 <130> FILE REFERENCE: NDR5233 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/10/528,674 
C--> 11 <141> CURRENT FILING DATE: 2005-03-23 

13 <150> PRIOR APPLICATION NUMBER: EP 02021603.2 

14 <151> PRIOR FILING DATE: 2002-09-27 
16 <160> NUMBER OF SEQ ID NOS: 8 

18 <170> SOFTWARE: PatentIn version 3.2 

20 <210> SEQ ID NO: 1 

21 <211> LENGTH: 1439 

22 <212> TYPE: DNA 

23 <213> ORGANISM: Pseudomonas denitrif icans 

26 <220> FEATURE: 

27 <221> NAME/KEY: CDS 

28 <222> LOCATION: (303 ).. (1304 ) 

30 <400> SEQUENCE: 1 

31 cttttaccga cacccgcacc agccgtgtga agcgtcgcaa tcggcaaaca caaccctgga 60 
33 gagcgctttg cgaatgttgc gtatctctgg cagttcttgt ccgatagcgt caacagcgct 120 
35 gtaaaaactg tcgctttcct tacaaaagcc tgaaagcggg gccgagccgc ctttcaccgc 180 
37 gcgatgtcgc aatgcgaaat ctctttgcgg tcgctgctat ccatgcgaat gtcgcaaaca 240 
39 gacaagccga agagtatctt ccaagcgacg aatataccgc atcgccaagc atgggaagcc 300 
41 tc atg aac aag cca ctg acc aaa aag cgt tct etc gtc ttc ttc ctg 347 



42 


Met Asn Lys Pro Leu Thr Lys Lys Arg Ser Leu Val Phe Phe Leu 




43 


1 






5 








10 








15 




45 


gtg 


ccg 


aac 


ttt 


tec 


atg 


ctg 


ecc 


ttt 


teg 


gcg 


gcg 


ate 


gaa 


aeg 


etc 


395 


46 


Val 


Pro 


Asn 


Phe 


Ser 


Met 


Leu 


Pro 


Phe 


Ser 


Ala 


Ala 


He 


Glu 


Thr 


Leu 




47 










20 










25 










30 






49 


cgc 


ate 


gee 


aac 


cgc 


atg 


etc 


gge 


tae 


gag 


gee 


tat 


tec 


tgg 


cgc 


ete 


443 


50 Arg 


He 


Ala 


Asn 


Arg 


Met 


Leu 


Gly 


Tyr 


Glu 


Ala 


Tyr 


Ser 


Trp 


Arg 


Leu 




51 








35 










40 










45 








53 


gca 


teg 


tee 


gae 


gge 


gaa 


aag 


gte 


ctg 


teg 


teg 


age 


ggt 


ate 


gcg 


ete 


491 


54 


Ala 


Ser 


Ser 


Asp 


Gly 


Glu 


Lys 


Val 


Leu 


Ser 


Ser 


Ser 


Gly 


He 


Ala 


Leu 




55 






50 










55 










60 










57 


gag 


gtc 


aac 


teg 


teg 


ett 


gca 


gae 


gag 


cgc 


aag 


ttt 


etc 


gge 


gge 


gaa 


539 


58 


Glu 


Val 


Asn 


Ser 


Ser 


Leu 


Ala 


Asp 


Glu 


Arg 


Lys 


Phe 


Leu 


Gly 


Gly 


Glu 




59 




65 










70 










75 












61 


aac 


cgc 


cec 


teg 


atg 


gtg 


ctg 


gtc 


tgt 


tec 


gge 


ate 


tat 


gtc 


gag 


gae 


587 


62 


Asn 


Arg 


Pro 


Ser 


Met 


Val 


Leu 


Val 


Cys 


Ser 


Gly 


He 


Tyr 


Val 


Glu 


Asp 




63 


80 










85 










90 










95 




65 


ttc 


aac 


aac 


aag 


teg 


gtc 


aat 


gee 


tgg 


ctg 


cgc 


gag 


gtc 


tae 


aat 


cgc 


635 


66 


Phe 


Asn 


Asn 


Lys 


Ser 


Val 


Asn 


Ala 


Trp 


Leu 


Arg 


Glu 


Val 


Tyr 


Asn 


Arg 





file://C:\CRF4\Outhold\VsrJ528674.htm 



4/1/05 



Page 2 of 6 



RAW SEQUENCE LISTING DATE: 04/01/2005 

PATENT APPLICATION: US/10/528,674 TIME: 10:54:06 



Input Set : A:\0185659SeqLi8t.txt 
Output Set: N:\CRF4\04012005\J528674.raw 



67 






100 










105 










110 






69 


ggc gtc gcc 


gtc 


ggc 


age 


etc 


tgt 


acc 


ggc 


gcc 


cat 


gtg 


ctg 


gcg 


teg 


683 


70 


Gly Val Ala 


val 


Gly 


Ser 


Leu 


Cys 


Thr 


Gly 


Ala 


His 


Val 


Leu 


Ala 


Ser 




71 




115 










120 










125 








73 


gcc ggt ctt 


ctg 


acc 


ggc 


aag 


cgc 


tgc 


gcc 


ate 


cac 


tgg 


gaa 


aac 


ctg 


731 


74 


Ala Gly Leu 


Leu 


Thr 


Gly 


Lys 


Arg 


Cys 


Ala 


He 


His 


Trp 


Glu 


Asn 


Leu 




75 


130 










135 










140 










77 


ccg ggc ttt 


tec 


gaa 


age 


ttc 


ccg 


cag 


gtc 


gac 


gtc 


tat 


gcc 


gac 


etc 


779 


78 


Pro Gly Phe 


Ser 


Glu 


Ser 


Phe 


Pro 


Gin 


Val 


Asp 


Val 


Tyr 


Ala 


Asp 


Leu 




79 


145 








150 










155 












81 


tac gaa ate 


gac 


age 


aac 


ate 


tac 


acc 


tgc 


gee 


ggc 


ggc 


ace 


gee 


teg 


827 


82 


Tyr Glu lie 


Asp 


Ser 


Asn 


He 


Tyr 


Thr 


Cys 


Ala 


Gly 


Gly 


Thr 


Ala 


Ser 




83 


160 






165 










170 










175 




85 


etc gac atg 


atg 


ctg 


aac 


ctg 


ate 


gac 


cag 


gat 


ttc 


ggc 


gag 


age 


etc 


875 


86 


Leu Asp Met 


Met 


Leu 


Asn 


Leu 


He 


Asp 


Gin 


Asp 


Phe 


Gly 


Glu 


Ser 


Leu 




87 






180 










185 










190 






89 


gtc aac egc 


gtc 


tgc 


gaa 


cag 


gcg 


ctg 


acc 


gat 


cgc 


gtg 


cgc 


ggg 


cec 


923 


90 


Val Asn Arg 


Val 


Cys 


Glu 


Gin 


Ala 


Leu 


Thr 


Asp 


Arg 


Val 


Arg 


Gly 


Pro 




91 




195 










200 










205 








93 


cat gac cgc 


cag 


cgc 


ctg 


ccg 


ctg 


cgc 


gcc 


cgt 


etc 


ggc 


gtg 


cag 


aac 


971 


94 


His Asp Arg 


Gin 


Arg 


Leu 


Pro 


Leu 


Arg 


Ala 


Arg 


Leu 


Gly 


Val 


Gin 


Asn 




95 


210 










215 










220 










97 


gee aag gtg 


ctg 


tec 


ate 


ate 


gaa 


ctg 


atg 


gag 


gca 


aac 


etc 


gee 


gag 


1019 


98 


Ala Lys Val 


Leu 


Ser 


He 


He 


Glu 


Leu 


Met 


Glu 


Ala 


Asn 


Leu 


Ala 


Glu 




99 


225 








230 










235 













101 


ccg 


ctt 


teg 


ctg 


etc 


gaa 


ate 


gcc 


gag 


ggc 


gee gat 


etc 


tec 


cgc 


cgc 


1067 


102 


Pro 


Leu 


Ser 


Leu 


Leu 


Glu 


He 


Ala 


Glu 


Gly 


Ala Asp 


Leu 


Ser 


Arg 


Arg 




103 


240 










245 










250 








255 




105 


cag 


ate 


gag 


cgc 


etc 


ttc 


cgc 


cag 


gaa 


atg 


ggc cgc 


teg 


cct 


gca 


cgc 


1115 


106 


Gin 


He 


Glu 


Arg 


Leu 


Phe 


Arg 


Gin 


Glu 


Met 


Gly Arg 


Ser 


Pro 


Ala 


Arg 




107 










260 










265 








270 






109 


tac 


tat 


etc 


gaa 


ate 


cgc 


etc 


gat 


cgc 


gca 


agg cac 


etc 


ttg 


ate 


cag 


1163 


110 


Tyr 


Tyr 


Leu 


Glu 


He 


Arg 


Leu 


Asp 


Arg 


Ala 


Arg His 


Leu 


Leu 


He 


Gin 




111 








275 










280 








285 








113 


teg 


teg 


atg 


ccg 


gtg 


gtc 


gaa 


gtg 


gee 


gta 


gee tgc 


ggc 


ttc 


gtc 


tec 


1211 


114 


Ser 


Ser 


Met 


Pro 


Val 


Val 


Glu 


Val 


Ala 


Val 


Ala Cys 


Gly 


Phe 


Val 


Ser 




115 






290 










295 








300 










117 


gee 


teg 


eac 


ttc 


tec 


aag 


tgt 


tat 


cgc 


gaa 


etc tac 


aac 


cgc 


teg 


ccg 


1259 


118 


Ala 


Ser 


His 


Phe 


Ser 


Lys 


Cys 


Tyr 


Arg 


Glu 


Leu Tyr 


Asn 


Arg 


Ser 


Pro 




119 




305 










310 








315 












121 


cag 


cag 


gag 


egc 


gee 


gac 


cgc 


aag 


ctg 


aeg 


ctg cag 


atg 


gcg 


ega 




1304 


122 


Gin 


Gin 


Glu 


Arg 


Ala 


Asp 


Arg 


Lys 


Leu 


Thr 


Leu Gin 


Met 


Ala 


Arg 






123 


320 










325 










330 













125 taagcggcag atcagatgga teagacaagg eggagcttct cegecetttt tcgttggtga 1364 

127 cactttccgc ttgtgccgte ctggcgettg eacgegcege tgttttggat tgaatggteg 1424 

129 gegttteagg agage 1439 

132 <210> SEQ ID NO: 2 

133 <211> LENGTH: 334 

134 <212> TYPE: PRT 
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135 


<213> ORGANISM: 


Pseudomonas 


denitrif icans 












137 


<400> SEQUENCE: 


2 
























139 


Met 


Asn 


Lys 


Pro 


Leu 


Thr 


Lys 


Lys 


Arg 


Ser 


Leu 


Val 


Phe 


Phe 


Leu 


Val 


140 


1 








5 










10 










15 




143 


Pro 


Asn 


Phe 


Ser 


Met 


Leu 


Pro 


Phe 


Ser 


Ala 


Ala 


He 


Glu 


Thr 


Leu 


Arg 


144 








20 










25 










30 






147 


He 


Ala 


Asn 


Arg 


Met 


Leu 


Gly 


Tyr 


Glu 


Ala 


Tyr 


Ser 


Trp 


Arg 


Leu 


Ala 


148 






35 










40 










45 








151 


Ser 


Ser 


Asp 


Gly 


Glu 


Lys 


Val 


Leu 


Ser 


Ser 


Ser 


Gly 


He 


Ala 


Leu 


Glu 


152 




50 










55 










60 










155 


Val 


Asn 


Ser 


Ser 


Leu 


Ala 


Asp 


Glu 


Arg 


Lys 


Phe 


Leu 


Gly 


Gly 


Glu 


Asn 


156 


65 










70 










75 










80 


159 


Arg 


Pro 


Ser 


Met 


Val 


Leu 


Val 


Cys 


Ser 


Gly 


He 


Tyr 


Val 


Glu 


Asp 


Phe 


160 










85 










90 










95 




163 


Asn 


Asn 


Lys 


Ser 


Val 


Asn 


Ala 


Trp 


Leu 


Arg 


Glu 


Val 


Tyr 


Asn 


Arg 


Gly 


164 








100 










105 










110 






167 


Val 


Ala 


Val 


Gly 


Ser 


Leu 


Cys 


Thr 


Gly 


Ala 


His 


Val 


Leu 


Ala 


Ser 


Ala 


168 






115 










120 










125 








171 


Gly 


Leu 


Leu 


Thr 


Gly 


Lys 


Arg 


Cys 


Ala 


He 


His 


Trp 


Glu 


Asn 


Leu 


Pro 


172 




130 










135 










140 










175 


Gly 


Phe 


Ser 


Glu 


Ser 


Phe 


Pro 


Gin 


Val 


Asp 


Val 


Tyr 


Ala 


Asp 


Leu 


Tyr 


176 


145 










150 










155 










160 


179 


Glu 


He 


Asp 


Ser 


Asn 


He 


Tyr 


Thr 


Cys 


Ala 


Gly 


Gly 


Thr 


Ala 


Ser 


Leu 


180 










165 










170 










175 




183 


Asp 


Met 


Met 


Leu 


Asn 


Leu 


He 


Asp 


Gin 


Asp 


Phe 


Gly 


Glu 


Ser 


Leu 


Val 


184 








180 










185 










190 






187 


Asn 


Arg 


Val 


Cys 


Glu 


Gin 


Ala 


Leu 


Thr 


Asp 


Arg 


Val 


Arg 


Gly 


Pro 


His 


188 






195 










200 










205 








191 


Asp 


Arg 


^Gln 


Arg 


Leu 


Pro 


Leu 


Arg 


Ala 


Arg 


Leu 


Gly 


Val 


Gin 


Asn 


Ala 


192 




210 










215 










220 










195 


Lys 


Val 


Leu 


Ser 


He 


He 


Glu 


Leu 


Met 


Glu 


Ala 


Asn 


Leu 


Ala 


Glu 


Pro 


196 


225 










230 










235 










240 


199 


Leu 


Ser 


Leu 


Leu 


Glu 


He 


Ala 


Glu 


Gly 


Ala 


Asp 


Leu 


Ser 


Arg 


Arg 


Gin 


200 










245 










250 










255 




203 


He 


Glu 


Arg 


Leu 


Phe 


Arg 


Gin 


Glu 


Met 


Gly 


Arg 


Ser 


Pro 


Ala 


Arg 


Tyr 


204 








260 










265 










270 






207 


Tyr 


Leu 


Glu 


He 


Arg 


Leu 


Asp 


Arg 


Ala 


Arg 


His 


Leu 


Leu 


He 


Gin 


Ser 


208 






275 










280 










285 








211 


Ser 


Met 


Pro 


Val 


Val 


Glu 


Val 


Ala 


Val 


Ala 


Cys 


Gly 


Phe 


Val 


Ser 


Ala 


212 




290 










295 










300 










215 


Ser 


His 


Phe 


Ser 


Lys 


Cys 


Tyr 


Arg 


Glu 


Leu 


Tyr 


Asn 


Arg 


Ser 


Pro 


Gin 


216 


305 










310 










315 










320 


219 


Gin 


Glu 


Arg 


Ala 


Asp 


Arg 


Lys 


Leu 


Thr 


Leu 


Gin 


Met 


Ala 


Arg 






220 










325 










330 














223 


<210> SEQ ID NO: 


: 3 
























224 


<211> LENGTH: 20 
























225 


<212.> TYPE: 


DNA 


























226 


<213> ORGANISM: 


Artificial Sequence 
















228 


<220> FEATURE: 
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229 <223> OTHER INFORMATION: Primer N-GUS 

231 <400> SEQUENCE: 3 

232 ccaggaatag gcctcgtagc 20 

235 <210> SEQ ID NO: 4 

236 <211> LENGTH: 31 

237 <212> TYPE: DNA 

238 <213> ORGANISM: Artificial Sequence 

240 <220> FEATURE: 

241 <223> OTHER INFORMATION: Primer C-GUS 

243 <400> SEQUENCE: 4 

244 cgcagagctc gagccaccga ggctgtagcc g 31 

247 <210> SEQ ID NO: 5 

248 <211> LENGTH: 32 

249 <212> TYPE: DNA 

250 <213> ORGANISM: Artificial Sequence 

252 <220> FEATURE: 

253 <223> OTHER INFORMATION: oligonucleotide 1 for cassette 

255 <400> SEQUENCE: 5 

256 ggatcctgac taactagcat ctggaggaca ca 32 

259 <210> SEQ ID NO: 6 

260 <211> LENGTH: 38 

261 <212> TYPE: DNA 

262 <213> ORGANISM: Artificial Sec[uence 

264 <220> FEATURE: 

265 <223> OTHER INFORMATION: oligonucleotide 2 for cassette 

267 <400> SEQUENCE: 6 

268 tatgtgtcct ccagatgcta gttagtcagg atcctgca 38 

271 <210> SEQ ID NO: 7 

272 <211> LENGTH: 25 

273 <212> TYPE: DNA 

274 <213> ORGANISM: Artificial Sequence 

276 <220> FEATURE: 

277 <223> OTHER INFORMATION: Primer COBRNde 

279 <400> SEQUENCE: 7 

280 ggcatatgaa caagccactg accaa 25 

283 <210> SEQ ID NO: 8 

284 <211> LENGTH: 34 

285 <212> TYPE: DNA 

286 <213> ORGANISM: Artificial Sequence 

288 <220> FEATURE: 

289 <223> OTHER INFORMATION: Primer COBR135R 
2 91 <400> SEQUENCE: 8 

292 gtcacccggg catatgttac gtcctgtaga aacc 34 
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; ' ' " PATENT APPLICATION: US/10/528,674 TIME: 10:54:07 
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